Scarica Packt | Scalable Data Analysis in Python with Dask [FCO] GloDLS torrent - GloDLS
Dettagli torrent per "Packt | Scalable Data Analysis in Python with Dask [FCO] GloDLS"

Packt | Scalable Data Analysis in Python with Dask [FCO] GloDLS

To download this torrent, you need a BitTorrent client: Vuze or BTGuard
Scarica questo torrent
Download using Magnet Link

Salute:
Semi: 104
Leechers: 66
Completato: 203 
Ultimo controllo: 06-08-2019 22:28:07

punti reputazione uploader : 17057





Write a Review for the Uploader:   235   Say Thanks with one good review:
Share on Facebook


Details
Nome:Packt | Scalable Data Analysis in Python with Dask [FCO] GloDLS
Descrizione:


By: Mohammed Kashif
Released: 30 May 2019 (New Release!)
Torrent Contains: 46 Files, 10 Folders
Course Source: https://www.packtpub.com/web-development/scalable-data-analysis-python-dask-video

Build high-performance, distributed, and parallel applications in Dask

Video Details

ISBN 9781789808926
Course Length 3 hours 31 minutes

Table of Contents

• Getting Started with Dask
• Understanding Dask Arrays
• Parallelizing Python Code with Dask
• Understanding Dask Dataframes
• Exploring Dask Bags
• Distributed Computing with Dask
• Advance Dask Features
• Machine Learning with Dask

Learn

• Understand the concept of Block algorithms and how Dask leverages it to load large data.
• Implement various example using Dask Arrays, Bags, and Dask Data frames for efficient parallel computing
• Combine Dask with existing Python packages such as NumPy and Pandas
• See how Dask works under the hood and the various in-built algorithms it has to offer
• Leverage the power of Dask in a distributed setting and explore its various schedulers
• Implement an end-to-end Machine Learning pipeline in a distributed setting using Dask and scikit-learn
• Use Dask Arrays, Bags, and Dask Data frames for parallel and out-of-memory computations

About


Data analysts, Machine Learning professionals, and data scientists often use tools such as Pandas, Scikit-Learn, and NumPy for data analysis on their personal computer. However, when they want to apply their analyses to larger datasets, these tools fail to scale beyond a single machine, and so the analyst is forced to rewrite their computation.

If you work on big data and you’re using Pandas, you know you can end up waiting up to a whole minute for a simple average of a series. And that’s just for a couple of million rows!

In this course, you’ll learn to scale your data analysis. Firstly, you will execute distributed data science projects right from data ingestion to data manipulation and visualization using Dask. Then, you will explore the Dask framework. After, see how Dask can be used with other common Python tools such as NumPy, Pandas, matplotlib, Scikit-learn, and more.

You’ll be working on large datasets and performing exploratory data analysis to investigate the dataset, then come up with the findings from the dataset. You’ll learn by implementing data analysis principles using different statistical techniques in one go across different systems on the same massive datasets.

Throughout the course, we’ll go over the various techniques, modules, and features that Dask has to offer. Finally, you’ll learn to use its unique offering for machine learning, using the Dask-ML package. You’ll also start using parallel processing in your data tasks on your own system without moving to the distributed environment.

All the code files and related files are uploaded on GitHub at this link: https://github.com/PacktPublishing/-Scalable-Data-Analysis-in-Python-with-Dask

Style and Approach

This hands-on course covers all the important components of Dask (arrays, bags, data frames, schedulers, and the Futures API) to parallelize your existing Python code and perform computations in a distributed setting. This course is designed with minimum theory and maximum practical implementation, followed by step-by-step instructions to get you up and running.

Features:

• Leverage the power of parallel computing using Dask.delayed
• Get complete exposure to using Dask to handle large data in a distributed setting
• Learn how to do machine learning by combining scikit-learn and Dask in a distributed setting

Author

Mohammed Kashif

Mohammed Kashif works as a Data Scientist at Nineleaps, India, dealing mostly with graph data analysis. Prior to this, he was working as a Python developer at Qualcomm. He completed his Master's degree in computer science from IIIT Delhi, with specialization in data engineering. His areas of interest include recommender systems, NLP, and graph analytics. In his spare time, he likes to solve questions on StackOverflow and help debug other people out of their misery. He is also an experienced teaching assistant with a demonstrated history of working in the higher-education industry.



YouTube Video:
Categoria:Tutorials
Lingua:English  English
Dimensione totale:1,004.76 MB
Info Hash:1DA8D6BA45354D7DF5BB65C8E3B79F59A057C4E6
Aggiunto di:Prom3th3uS Super AdministratorMovie PirateVIP
Data di aggiunta:2019-08-06 22:59:00
Stato torrent:Torrent Verified


Rating:Not Yet Rated (Log in to rate it)


Tracker:
udp://tracker.iamhansen.xyz:2000/announce

Questo torrente ha anche inseguitori backup
URLSeedersLeechersCompletato
udp://tracker.iamhansen.xyz:2000/announce6515
udp://tracker.torrent.eu.org:451/announce7519
udp://tracker.cyberia.is:6969/announce8414
udp://tracker.leechers-paradise.org:6969/announce000
udp://tracker.uw0.xyz:6969/announce8413
udp://exodus.desync.com:6969/announce8415
udp://explodie.org:6969/announce5311
udp://denis.stalker.upeer.me:6969/announce8515
udp://tracker.opentrackr.org:1337/announce7516
udp://9.rarbg.to:2710/announce8623
udp://tracker.tiny-vps.com:6969/announce9617
udp://ipv4.tracker.harry.lu:80/announce720
udp://tracker.coppersurfer.tk:6969/announce9617
udp://tracker.internetwarriors.net:1337/announce7612
udp://tracker.opentrackr.org:1337/announce7516


File List: 





Comments
nonNessun commento postato ancora